# 224x224 Resolution

Pvt Medium 224
Apache-2.0
PVT is a Transformer-based vision model that employs a pyramid structure for image processing, pre-trained on ImageNet-1K, suitable for image classification tasks.
Image Classification Transformers
P
Xrenya
13
0
Convnext Tiny Finetuned Cifar10
Apache-2.0
This model is a tiny version based on the ConvNeXT architecture, fine-tuned on the CIFAR10 dataset, suitable for image classification tasks.
Image Classification Transformers
C
ahsanjavid
2,014
1
Levit 128S
Apache-2.0
LeViT-128S is a vision Transformer model pretrained on the ImageNet-1k dataset, combining the advantages of convolutional networks for faster inference.
Image Classification Transformers
L
facebook
3,198
4
Levit 384
Apache-2.0
LeViT-384 is a vision Transformer model pre-trained on the ImageNet-1k dataset, combining the advantages of convolutional networks for faster inference speed.
Image Classification Transformers
L
facebook
37
0
Resnet 152
Apache-2.0
A deep residual network model pre-trained on the ImageNet-1k dataset for image classification tasks
Image Classification Transformers
R
microsoft
18.22k
12
Beit Large Patch16 224 Pt22k Ft22k
Apache-2.0
BEiT is a Vision Transformer (ViT)-based image classification model, pre-trained in a self-supervised manner on ImageNet-22k and fine-tuned on the same dataset.
Image Classification
B
microsoft
1,880
5
Convnext Large 224
Apache-2.0
ConvNeXT is a pure convolutional model inspired by vision Transformers, trained on the ImageNet-1k dataset at 224x224 resolution.
Image Classification Transformers
C
facebook
740
27
Deit Base Distilled Patch16 224
Apache-2.0
The distilled version of the Efficient Data Image Transformer (DeiT) model was pre-trained and fine-tuned on ImageNet-1k at 224x224 resolution, extracting knowledge from a teacher model through distillation learning.
Image Classification Transformers
D
facebook
35.53k
26
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase